PALETTE-BASED HISTOGRAM MATCHING 
WITH RECURSIVE HISTOGRAM VECTOR GENERATION 

BACKGROUND OF THE INVENTION 

5 1 . Field of the Invention 

This invention relates to the field of image processing and pattern matching, and in 
particular to the comparison of two datasets based on a histogram characterization of each of the 
two datasets. 

10 2. Description of Related Art 
^ Histograms are commonly used to characterize sets of data. Similarities between sets of 

data can often be determined by comparing the relative frequency of occurrence of common 

UJ 

yi values in each of the sets of data. Pattern matching, and in particular image matching, often 

p includes histogram comparisons to determine similarities between images, or to search for target 

objects within larger images. 
3 A histogram comprises 'classes', or 'bins', for containing a frequency count of the number 

Q 

of members of the class within the dataset being characterized. Each data item is associated with 
[7 a class, based on its value, and the occurrence of each data value in the dataset causes the 
0 frequency count of the associated class to be incremented. In some histogram structures, each 
"20 data item may be associated with multiple classes, and the frequency count of each class is 

incremented for each occurrence of the data item. For ease of reference herein, each data value is 
assumed to be associated with a single class, without loss of generality, because the principles of 
this invention are applicable regardless of the techniques used to assign frequency counts to 
classes. 

25 The number of classes used to classify the different data values of a dataset generally 

determines the discriminating capability, or resolution, of the histogram. If many different data 
values are assigned to the same histogram class, the histograms of datasets having substantially 
different data values may be equivalent, and therefore indistinguishable. Conversely, if each 
different data value is assigned to a different histogram class, the histograms of datasets having 

30 very similar, but not exactly the same, data values will be substantially different, even though the 
datasets are similar. Generally, the number of classes used to classify the different data values is 
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selected to provide a good distinguishing capability among substantially different datasets while 
also providing similar or equivalent histograms for substantially similar datasets. 

In a conventional histogram comparison process, the time required to provide a measure 
of similarity between histograms is proportional to the number of classes in the histograms, and 
the time to compute the histogram of an area is proportional to the size of the area. A histogram 
structure that has good distinguishing capability (i.e. a large number of classes) may result in an 
excessively long processing time. In some applications, such as real-time video processing, the 
speed of processing is crucial to the feasibility of the application, and compromises that reduce 
the distinguishing capabilities of the histogram are often imposed. 

A common application in the field of image processing is a search of a relatively large 
image for the location of a relatively small target image. In such an application, each region of 
the larger image that could possibly contain the smaller target image is assessed to determine 
whether the content of the region corresponds to the target content. In the context of this 
application, a histogram is created for the content of the target, and this target histogram is 
compared to a histogram corresponding to each of the possible target-containing regions in the 
larger image. Consider an image that is N by M pixels in size, and a target that is n by m pixels 
in size. The image comprises (N-(n-l))x(M-(m-l)) regions that could contain the target. In a 
typical application, an image may be 1024x1024 pixels in size, and a target may be less than 
100x100 pixels in size. In this example, the image comprises almost a million possible regions, 
and a histogram is required for each of these regions. This potentially large number of regions to 
evaluate further necessitates an efficient histogram creation and histogram comparison process, 
particularly for real-time image processing. 

BRIEF SUMMARY OF THE INVENTION 
It is an object of this invention to provide a histogram comparison process that allows for 
sufficient distinguishing capabilities among histograms without requiring excessive processing 
time. It is a further object of this invention to provide a histogram comparison process having a 
processing time that is substantially independent of the number of classes in the histogram. It is a 
further object of this invention to provide a histogram comparison process that is independent of 
the size of a target. It is a further object of this invention to provide a histogram comparison 
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process that is particularly efficient for assessing histograms corresponding to overlapping 
regions in a larger dataset. 

These objects and others are achieved by providing a histogram comparison process that 
compares a histogram-vector of a first dataset to a corresponding histogram-vector of a second 
dataset. The histogram-vectors of the first and second dataset are formed from the histogram 
values of a select few of the histogram classes, the select few classes being the classes of the first 
dataset that contain the highest frequency count. The second dataset is characterized using only 
the select few classes of the first dataset; data items with values that belong to other classes are 
ignored. A palette dataset is created, corresponding to the second dataset, wherein the data items 
in the third dataset correspond to indexes to the select classes of the first dataset, or null indexes 
for the ignored data items. The histograms corresponding to overlapping regions is determined 
recursively, based on the third dataset. Given a histogram corresponding to a first region, the 
histogram of a second, overlapping, region is determined by merely incrementing the frequency 
count of the indexes of the elements that are in the second region, but not the first, and 
decrementing the frequency count of the indexes of the elements that are in the first region, but 
not the second. 

BRIEF DESCRIPTION OF THE DRAWINGS 
The invention is explained in further detail, and by way of example, with reference to the 
accompanying drawings wherein: 

FIG. 1 illustrates an example array of values, corresponding to an example search array. 
FIG. 2 illustrates an example target array of values, and the formation of a target histogram 
vector in accordance with this invention. 

FIG. 3 illustrates an example flow diagram of an array comparison process in accordance with 
this invention. 

FIG. 4 illustrates an example palette array corresponding to occurrences of target palette values 
in the search array, in accordance with this invention. 

FIGs. 5 A and 5B illustrate an example recursive determination of search histogram vectors for 
overlapping regions of the search array in accordance with this invention. 

Throughout the drawings, the same reference numerals indicate similar or corresponding 
features or functions. 
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DETAILED DESCRIPTION OF THE INVENTION 
This invention provides for a histogram-based comparison of two datasets, and is 
particularly well suited for locating a first dataset, or an approximation of the first dataset, within 
one or more regions of a larger dataset, or within one or more datasets within a larger collection 
of datasets. For ease of reference, the first dataset is termed a target dataset, or target. In the 
context of image processing, the target may be an image of a person. The histogram-based 
comparison process of this invention can be used to find the image of the person within a larger 
image of a group of people, or to find the image of the person among a collection of different 
images of people, or a combination of the two. Although this invention is presented using images 
as a paradigm dataset, it is applicable to the comparison of any sets of data based on the 
frequency of occurrences of classes of values (i.e. histograms), as would be evident to one of 
ordinary skill in the art in view of this disclosure. 

FIG. 1 illustrates an example array of values 100, corresponding to a dataset that is to be 
compared to a target array of values (210 in FIG. 2). For ease of reference and illustration, the 
invention is presented in the context of finding a smaller target dataset within a larger search 
dataset, although, as noted above, the invention is applicable for finding a target dataset within a 
collection of other datasets, and each search dataset may be the same size as the target dataset. 
Although the search dataset is preferably at least as large as the target dataset, one of ordinary 
skill in the art will recognize that the comparison can be structured to find one or more smaller 
portions of the target in a smaller search dataset. Additionally, the datasets need not be two- 
dimensional arrays of values; single dimensional or multidimensional arrays are also comparable 
using the principles of this invention. 

Each element 101 of the search array 100 is illustrated as having a data value within the 
range of a-z. In the context of image processing, these data values may correspond to a color 
value, a luminance value, a texture value, etc., or a combination of these values. For example, 
copending U.S. patent application "OBJECT TRACKING BASED ON COLOR 

DISTRIBUTION", serial number , filed for Miroslav Trajkovic, Attorney 

Docket US010238, and incorporated by reference herein, discloses the use of a composite data 
value is determined by a chromatic component if the data item (a pixel of an image) is 
distinguishable from gray, and a brightness component if the data item is gray, or near gray. 
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Furthermore, the chromatic component is a combination of the measures of hue (color) and 
saturation (whiteness) of each data item. In other contexts, such as demographics, the data values 
may correspond to classifications of economics, political affiliation, and so on. 

FIG. 2 illustrates an example target array of values 210, using the same set of possible 
5 data values (a-z) as the search array 100. That is, in the context of image processing, the letter 'a* 
in the target array 210 corresponds to the same color value, or composite value, as the data items 
in the search image 100 that also have the letter 'a', and vice versa. In general, the same process 
is used to determine the values of the data items in both the target and the search array. 

In general, a target dataset will contain fewer different values than the number of possible 
10 data values. That is, for example, few, if any, images will contain all possible color values. In the 
example of FIG. 2, from the range of 26 possible values, or histogram bins, of a-z, the target 
Ja contains nine different values: f, a, j, b, s, d, p, g, and h. The contents of histogram bins of each 

*I of these nine values 220 corresponding to the target image is illustrated at 230. As illustrated, the 

y i 

Jp data value of 'a 1 occurs in the target four times; the value 's 1 occurs three times; the values 'd' and 
rfj5 'f occur twice, and the values 'g\ 'h', 'j', 'b\ and 'p' each occur once. All other histogram bins in a 
^ histogram representation of the target will contain a frequency count of zero. 
rz In accordance with this invention, the "N" most frequently occurring data values in the 

M. target are identified. In this example, N is chosen to be seven, and the seven most frequently 
□ occurring data values are identified as: a, s, d, f, g, h, and j. Tied values are selected arbitrarily 
20 for inclusion among the N values. That is, the letters g, h, j, b, and p each occur once; any three 
of these letters may be selected to complete the selection of seven data values. 

Because this invention is particularly well suited for image processing and the matching 
of images based on histograms of color values, the collection of the N most frequently occurring 
data values in a target dataset is herein termed the target palette. In the example of FIG. 2, the 
25 target palette includes the values a, s, d, f, g, h, and j, with sequential indices 1-7 assigned to 
each, as indicated by the column 240. These indices 240 are used to form a vector 250, 
hereinafter termed the target histogram vector, or target vector, that indicates the number of each 
palette value in the target. That is, the first element, or dimension, of the vector corresponds to 
the palette value of 'a' (palette index T), and the value of the first element is four, thereby 
30 indicating that the target contains four occurrences of the value 'a 1 . In like manner, the second 
element indicates that the target contains three occurrences of the value V (palette index '2'), the 
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third element indicates that the target contains two occurrences of the value 'd' (palette index '3), 
and so on. Note that, because the target palette and vector are limited to a size of N (7), some of 
the values (specifically b and p) that are contained in the histogram 230 corresponding to the 
target are not represented in the target palette, nor in the target vector. As will be presented 
5 hereinafter, the speed of comparing datasets using the principles of this invention is substantially 
dependent upon N, as is the ability to distinguish among similar datasets. The choice of N is 
dependent upon the particular application, and an appropriate tradeoff between speed and 
performance. In an image processing application, the value of N is typically seven or fifteen, for 
compatibility with common byte or word sizes of eight and sixteen bits. 

10 

In accordance with this invention, the comparison of the target to another dataset, or 
^0 subset of a dataset, is effected by comparing the target histogram vector with a histogram vector 
[jj that corresponds to the other dataset, or subset. The comparison of the target histogram vector to 
£ a histogram vector of another dataset or subset is facilitated by the use of a "palette array" that 
|U5 corresponds to the occurrences of the target palette values in the search array. In the example of 
\ the search array 1 00, each element 1 0 1 of the array that contains one of the palette values of a, s, 
^ d, f, g, h, or j, is identified in the palette array by the palette index number, 1, 2, 3, 4, 5, 6, or 7, 

respectively. All other elements in the palette array are set to zero, or null, to indicate that a 
m palette value is not contained at the corresponding element of the search array. FIG. 4 illustrates 
20 a palette array 400 that corresponds to the example search array 100 of FIG. 1, and is discussed 

further hereinafter. 



FIG. 3 illustrates an example flow diagram of a dataset comparison process in 
accordance with this invention. At 310, the target histogram is determined by counting the 
number of occurrences of each data value in the target dataset. A mapping may be effected to 
map, or quantize, particular values into a data value corresponding to a class, or bin, of the 
defined histogram. For example, if the histogram classes correspond to a partitioning of a 
continuous range of values into integer values, a rounding or truncation process will be applied 
to identify an integer data value corresponding to the appropriate histogram class, or bin. 

A target palette is formed, as presented above, by identifying the N most frequently 
occurring data values (histogram classes) within the target dataset, at 320. The target palette is 
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used to form the target histogram vector, at 330, as presented above, and to create a palette array 
corresponding to the occurrences of target palette values in the search dataset, at 340. 

FIG. 4 illustrates an example palette array 400 corresponding to occurrences of the 
example target palette values (a, s, d, f, g, h, and j) of FIG. 2 within the search array 100 of FIG. 
5 1 . The first five elements of the first row of the search array 100 of FIG. 1 are q, z, g, w, and s, 
and the first five elements of the first row of the palette array 400 of FIG. 4 are 0, 0, 5, 0, and 2, 
respectively. The first two letters, q and z, are not in the set of target palette values, and thus the 
palette array 400 contains a zero or null value in the corresponding first two locations of the first 
row. The third letter, g, is a member of the set of target palette values, and its palette index is 
five. Thus, the third element of the first row of the palette array 400 is the index value, five. In 
like manner, the fourth and fifth letters, w and s, map to palette array values of zero (w is not a 
palette value) and two (s is the second value in the target palette). Algorithmically, this process 
is defined as: 

if Vjj is a member of the palette values, 
thenp^m, 
elsepy = 0; 

where V y is the value of the data item at the ij location of the search dataset (100 in FIG. 1), p r 
is the value of the corresponding ij entry in the palette array, and m is the index of the palette 
value corresponding to the search value V ; . 

'J 



A search for the target within the image is effected by processing the image palette 400. 
The image palette 400 is searched for a region of the same size as the target that has a similar 
histogram vector to the target histogram vector, via the loop 350-380 of FIG. 3. As would be 
evident to one of ordinary skill in the art, regions that differ in size from the target histogram 
vector may be used, provided the histograms are normalized for the differences in size. Equal 
target and region sizes are assumed hereinafter, for ease of reference. 

Because the index to the palette values that are used to create the target histogram vector 
is associated with each element in the image palette 400, the creation of the histogram vector for 
the region of the search dataset merely requires an accumulation of the number of times each 
index appears in the region of the search dataset. A region 410 of FIG. 4, for example, which 
includes three occurrences of palette-index- 1, one occurrence of palette-index-2, two 
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occurrences of palette-index-3, and so on, has a histogram vector of (3,1,2,1,1,0,1) 415. In like 
manner, a region 420 has a histogram vector of (3,3,2,2,1,0,1) 425. The histogram vector for 
each target-sized region of the palette array is determined, at 360 in FIG. 3, and compared to the 
target histogram vector, at 370. Preferably, the following equation is used to determine the 
5 similarity between two histogram vectors: 

S = f,mm(hR k ,hT k ), 

where hR is the histogram vector of the region, hT is the histogram vector of the target, and N is 
the length, or number of dimension, in each histogram vector. In the example, the similarity of 
vector 415 of FIG. 4 (3,1,2,1,1,0,1) to the target vector 250 of FIG. 2 (4,3,2,2,1,1,1) is: 
S = min(3,4)+min( 1 ,3)+min(2,2)+min( 1 ,2)+min( 1 , 1 )+min(0, 1 )+min( 1,1) 
S = 3+1+2+1+1+0+1 =9. 
The similarity of vector 425 (3,3,2,2,1,0,1) to the target vector 250 (4,3,2,2,1,1,1) is: 

S = min(3,4)+min(3,3)+min(2,2)+min(2,2)+min(l,l)+min(0,l)+min(l,l) 
S = 3+3+2+2+1+0+1 = 12. 

Based on these two similarity measures, region 420 (corresponding to vector 425) is determined 
to be more similar to the target than region 410. This similarity measure can be normalized to a 
0-1 range, based on the maximum possible value, which is the sum of the components of the 
target histogram vector hT. In this example, the maximum value is 14, and thus the region 420 is 
determined to have a normalized similarity measure of 12/14, or approximately 0.86, compared 
to a similarity measure of approximately 0.63 for region 410. 

Note that a histogram vector must be determined for each different target-size region in 
the search set, or palette 400. In accordance with another aspect of this invention, each region's 
histogram vector is determined based on the histogram vector of its immediate neighbor. 
Consider, for example, vertically adjacent target-sized regions 450 and 460 in FIG. 4, region 460 
being the region defined by the dashed outline. The histogram vector of region 450 is 
(1,1,0,2,0,1,1). The histogram vector of region 460 can be determined by decrementing the 
vector value of the index associated with the set of pixels in the first row of the region 450 
(corresponding to the pixels that are in region 450 but not in region 460), and incrementing the 
vector value of the index associated with the set of pixels in the row below the region 450 
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(corresponding to the pixels in region 460 that are not in region 450). In this example, the vector 
value of the seventh palette color is decremented, and the vector value of the fourth palette color 
is incremented, giving a histogram vector of region 460 of (1,1,0,2+1,0,1,1-1) = (1,1,0,3,0,1,0). 
In like manner, a histogram of a horizontally adjacent region to another region can be 
5 determined by decrementing the vector values of the indexes in the vertical column that is 
'deleted' from the other region and incrementing the vector values of the indexes in the vertical 
column that is 'added' to the other region to form the horizontally adjacent region. Thus, after 
determining the first target-sized region's histogram vector, each subsequent histogram vector 
can be determined by a limited number of increment or decrement operations. 
1 0 FIGs. 5 A and 5B illustrate a further efficiency improvement for the recursive 

determination of search histogram vectors for overlapping regions of the search array in 
q accordance with this invention. This embodiment is particularly well suited for systems that are 
configured to effect vector operations directly, such as systems that contain MMX technology. 
4= The example target region 450 is shown in FIG. 5A; for ease of reference, indexes i j are 

fl£ used to identify the row (i) and column (j) of each element. The index i j at the upper left of the 
region is used to identify the region. Region 460 in FIG. 5B, for example, is identified as region 
(i+1 j), being in the same column (j) as region 450, but in the next lower row (i+1). 
M> A histogram vector h h is illustrated corresponding to each row in the region 450. The 

p vector h h (i j) 45 1 corresponds to the target-sized row that begins at (i j); h h (i+ 1 j) 452 as the row 
2D beneath it, and so on. The histogram vector 455 of region 450 is illustrated as the vector sum of 
each of the row vectors, and identified as h/ij). 

Given that the histogram vector h/i j) exists, the histogram vector h/i+lj), 
corresponding to region 460, can be computed by merely subtracting the histogram vector 
corresponding to the uppermost row of the region, h h (ij) 45 1, and adding the histogram vector 
25 corresponding to the row beneath the region 450, ^(i+4 j) 464, as illustrated in FIG. 5B. 
Algorithmically: 

K 0' + 1, J) = K (i, j) + h h (i + w, j) - h h (i, j) , 
where w is the number of rows of the region. Note that a single vector addition and subtraction 
provides for a determination of a histogram vector corresponding to a region, independent of the 
30 vertical size, w, of the region. The speed of the vector addition and subtraction is dependent 
upon the number of dimensions in the vector, which, in this application is the number of palette 
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values. The number of palette values determines the discriminating capabilities of the above 
described similarity measure, and thus, in accordance with this invention, the speed of the 
process is directly related to the discriminating capabilities of the histogram comparison process 
of this invention. 

5 The above equation is applied to determine the histogram vector corresponding to each 

region in a vertical column (j). In a preferred embodiment of this invention, a recursive 
technique is also applied in the processing of adjacent vertical columns (j+1). As noted above, 
horizontal row vectors h h are used to compute a region histogram vector in a time that is 
substantially independent of the vertical size of the region. In like manner, each horizontal row 
1 0 vector h h is determined in a time that is substantially independent of the horizontal size of the 
region, as follows: 

o 

W 

Ul K (*> J + 1)[/K*. j + /»)] + + 

P h„(i,J + mp(iJ)] — , 

W 

where p(x,y) is the palette index value at the x,y location in the palette array, and m is the width 
o of the region. The above equations are written in the "C" language, and correspond to setting the 
jjj adjacent (i j+ 1 ) histogram vector equal to the prior (i j) histogram vector; incrementing the p* 
g vector element in the histogram vector, where p is the content of the palette array element 
M= (ij+m) that is immediately adjacent the last horizontal element of the row of the region (ij); and 
20 decrementing the p* vector element in the histogram vector, where p is the content of the first 

palette array element (i j) of the row of the region (ij). In this manner, only a single increment 

and decrement operation is required to determine each row vector h h , regardless of the horizontal 

size, m, of the region. 

An example of this recursive determination of a row vector is illustrated with regard to 
25 the determination of the row vector for the elements (0- 1 -6- 1 ) within the dashed outline 472 of 
FIG. 4, which is immediately adjacent to the second palette row (4-0-1-6) of region 450. The 
row vector 452 (1,0,0,1,0,1,0) corresponding to this second palette row (4-0-1-6) is illustrated in 
FIG. 5A. To determine the vector corresponding to row 472 from the row vector 452 of the prior 
adjacent row, the vector value is first set equal to the prior vector 452 (1,0,0,1,0,1,0). Then, the 
vector element corresponding to the palette value of the added palette element (" 1 ") 473 is 
incremented (1+1,0,0,1,0,1,0), and the vector element corresponding to removed palette element 
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("4") 474 is decremented (1+1,0,0,1-1,0,1,0) to form a vector (2,0,0,0,0,1,0) corresponding to the 
desired element row (0-1-6-1) 472. Note that, because only the first and last-plus-one palette 
values of the prior row are required in this determination, this single increment and decrement 
process is applicable regardless of the width of the element row. 

Similar equations can be derived for other structures than a two-dimensioned (i,j) array 
forming the target and search datasets, as would be evident to one of ordinary skill in the art in 
view of this disclosure. For example, a one-dimensioned (linear) array corresponds to a constant 
(unity) value of j, and the recursive determination of adjacent horizontal rows, described 
immediately above, can be used alone, thereby requiring a single increment and decrement 
operation to determine each histogram vector. 

Preferably, this invention is embodied as a computer program that is executable on a 
processing system, with ancillary hardware elements, such as image and vector processors being 
controlled by the processing system. Other embodiments will be evident to one of ordinary skill 
in the art. For example, the palette array 400 may be created in a memory by a software routine 
that also determines the target vector and a first region vector. An "application-specific" 
integrated circuit, such as a programmable gate array, can be provided that is configured to 'step- 
through' the memory, comparing the target vector to the region vector, and applying the 
recursive algorithms defined above to determine each subsequent region vector, and repeating 
the process until each of the regions have been evaluated. 

The foregoing merely illustrates the principles of the invention. It will thus be 
appreciated that those skilled in the art will be able to devise various arrangements which, 
although not explicitly described or shown herein, embody the principles of the invention and 
are thus within the spirit and scope of the following claims. 
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